NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Analyzing Strategies in MATHia with BERT

Thapa_Magar, Abisha; Fancsali, Stephen E; Rus, Vasile; Murphy, April; Ritter, Steve; Venugopal, Deepak (July 2025, 26th International Conference Artificial Intelligence in Education)

Full Text Available
SelfCode 2.0: An Annotated Corpus of Student and Expert Line-by-Line Explanations of Code Examples for Automated Assessment

https://doi.org/10.32473/flairs.38.1.138727

Chapagain, Jeevan; Lekshmi, Arun Balajiee; Akhuseyinoglu, Kamil; Brusilovsky, Peter; Rus, Vasile (May 2025, The International FLAIRS Conference Proceedings)

Assessing student responses is a critical task in adaptive educational systems. More specifically, automatically evaluating students' self-explanations contributes to understanding their knowledge state which is needed for personalized instruction, the crux of adaptive educational systems. To facilitate the development of Artificial Intelligence (AI) and Machine Learning models for automated assessment of learners' self-explanations, annotated datasets are essential. In response to this need, we developed the SelfCode2.0 corpus, which consists of 3,019 pairs of student and expert explanations of Java code snippets, each annotated with semantic similarity, correctness, and completeness scores provided by experts. Alongside the dataset, we also provide performance results obtained with several baseline models based on TF-IDF and Sentence-BERT vectorial representations. This work aims to enhance the effectiveness of automated assessment tools in programming education and contribute to a better understanding and supporting student learning of programming.
more » « less
Full Text Available
"Can A Language Model Represent Math Strategies?": Learning Math Strategies from Big Data using BERT

https://doi.org/10.1145/3706468.3706558

Thapa_Magar, Abisha; Shakya, Anup; Fancsali, Stephen E; Rus, Vasile; Murphy, April; Ritter, Steve; Venugopal, Deepak (March 2025, ACM)

Full Text Available
Analyzing Strategies in MATHia with BERT

https://doi.org/10.1007/978-3-031-98465-5_15

Magar, Abisha Thapa; Fancsali, Stephen E; Rus, Vasile; Murphy, April; Ritter, Steve; Venugopal, Deepak (January 2025, Artificial Intelligence in Education. AIED 2025 Conference proceedings, Springer Nature Switzerland)

Full Text Available
Learning Representations for Math Strategies using BERT

https://doi.org/10.1145/3657604.3664711

Thapa_Magar, Abisha; Fancsali, Stephen E; Rus, Vasile; Murphy, April; Ritter, Steve; Venugopal, Deepak (July 2024, ACM)

Full Text Available
Automated Assessment of Students’ Code Comprehension using LLMs

Oli, Priti; Banjade, Rabin; Chapagain, Jeevan; Rus, Vasile (February 2024, MLResearchPress)

Full Text Available
Exploring The Effectiveness of Reading vs. Tutoring For Enhancing Code Comprehension For Novices

https://doi.org/10.1145/3605098.3636007

Oli, Priti; Banjade, Rabin; Lekshmi_Narayanan, Arun Balajiee; Brusilovsky, Peter; Rus, Vasile (April 2024, ACM Symposium on Applied Computing, SAC 2024)

This paper presents a comparison of two instructional strategies meant to help learners better comprehend code and learn programming concepts: reading code examples annotated with expert explanation (worked-out examples) versus scaffolded self-explanation of code examples using an automated system (Intelligent Tutoring System). A randomized controlled trial study was conducted with 90 university students who were assigned to either the control group (reading worked-out examples, a passive strategy) or the experimental group where participants were asked to self-explain and received help, if needed, in the form of questions from the tutoring system( scaffolded self-explanation, an interactive strategy). We found that students with low prior knowledge in the experimental condition had significantly higher learning gains than students with high prior knowledge. However, in the control condition, this distinction in learning outcomes based on prior knowledge was not observed. We also analyzed the effect of self-efficacy on learning gains and the nature of self-explanation. Low self-efficacy students learn almost twice as much in the interactive condition versus the passive condition although the difference was not significant probably because of low sample size. We also found that high self-efficacy students tend to provide more relational explanations whereas low self-efficacy students provide more multi-structural or line-by-line explanations.
more » « less
Full Text Available
The Behavior of Large Language Models When Prompted to Generate Code Explanations

Oli, Priti; Banjade, Rabin; Chapagain, Jeevan; Rus, Vasile (December 2023, Proceedings of the workshop on Generative AI for Education (GAIED) at the Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023))

This paper systematically investigates the generation of code explanations by Large Language Models (LLMs) for code examples commonly encountered in introductory programming courses. Our findings reveal significant variations in the nature of code explanations produced by LLMs, influenced by factors such as the wording of the prompt, the specific code examples under consideration, the programming language involved, the temperature parameter, and the version of the LLM. However, a consistent pattern emerges for Java and Python, where explanations exhibit a Flesch-Kincaid readability level of approximately 7-8 grade and a consistent lexical density, indicating the proportion of meaningful words relative to the total explanation size. Additionally, the generated explanations consistently achieve high scores for correctness, but lower scores on three other metrics: completeness, conciseness, and specificity.
more » « less
Explaining Code Examples in Introductory Programming Courses: LLM vs Humans

Lekshmi-Narayanan, Arun-Balajiee; Oli, Priti; Chapagain, Jeevan; Hassany, Mohammad; Banjade, Rabin; Brusilovsky, Peter; Rus, Vasile (February 2024, Workshop on AI for Education - Bridging Innovation and Responsibility at AAAI 2024)

Worked examples, which present an explained code for solving typical programming problems are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarely have time to provide explanations for many examples typically used in a programming class. In this paper, we assess the feasibility of using LLMs to generate code explanations for passive and active example exploration systems. To achieve this goal, we compare the code explanations generated by chatGPT with the explanations generated by both experts and students.
more » « less
Full Text Available
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational Data

Shakya, Anup; Rus, Vasile; Venugopal, Deepak (January 2023, International Conference on Educational Data Mining 2023)

Full Text Available

« Prev Next »

Search for: All records